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1.  Background 


An  unmanned  ground  vehicle  (UGV)  capable  of  autonomous  mobility  is  typically  equipped  with  a 
navigation  system  for  sensing  the  UGY’s  position  in  the  world  and  a  sensor  for  detecting  obstacles. 
An  obstacle-detecting  sensor  based  on  ladar  (often  interpreted  as  a  portmanteau  of  laser  radar  and 
used  interchangeably  with  lidar)  is  frequently  used.  Returns  from  the  ladar  sensor  classify  points 
of  the  terrain  as  “occupied”  (presumably  ground  or  an  obstacle);  volumes  from  which  no  returns 
are  received  are  “free  space”  or  unknown.  This  sensor  information  enables  the  construction  of  a 
model  of  the  world  (the  world  map)  used  by  the  UGV  to  plan  its  route  to  the  goal  location,  driving 
on  terrain  known  to  be  "ground"  (occupied  and  horizontal)  and  avoiding  obstacles  detected  in  its 
path.  A  world  map  is  frequently  discretized  at  a  resolution  suitable  for  navigation,  and  the 
presence  or  absence  of  ladar  returns  is  abstracted  to  indicate  the  binary  state  of  the  nearby 
enclosing  volume.  The  resulting  world  map  may  be  no  more  than  a  two-dimensional  (2-D) 
grid  classifying  each  grid  square  as  “passable”  or  “obstacle,”  or  a  (so-called)  two-and-a-half 
dimensional  grid  representing  the  elevation  of  the  grid  square. 


2.  Problem  Statement 


In  this  study,  the  UGV  obstacle-detecting  sensor  is  used  to  construct  a  full  three-dimensional  (3-D), 
although  simplified,  world  model  during  a  UGV  patrol  through  an  area.  Then,  in  a  second  patrol 
through  the  same  terrain,  the  UGV  contrasts  fresh  sensor  data  against  the  world  model  created 
during  the  first  patrol  to  judge  what  elements  of  the  terrain  have  changed  from  free  space  to  occu¬ 
pied  space  or  vice  versa.  It  is  assumed  that  the  terrain  is  static  during  each  patrol  and  that  the  only 
changes  of  interest  occur  between  patrols.  It  is  recognized  that  these  assumptions  are  unlikely  to 
hold  in  any  kind  of  realistic  scenario.  The  study  is  intended  as  a  precursor  to  a  method  for  recog¬ 
nizing  a  moving  obstacle,  particularly  a  moving  pedestrian,  during  a  patrol  with  the  use  of  the 
sensors  native  to  an  autonomous  UGV. 

2.1  Imaging  Ladar  Technology 

The  ladar  sensor  is  favored  by  the  UGV  community  since  it  implicitly  senses  the  range  from  the 
UGV  to  an  obstacle  in  a  specific  direction  (unlike  a  video  sensor  which  senses  the  color  in  a 
specific  direction,  or  a  sonar  sensor  which  senses  the  range  in  a  general  direction).  Ladar  works 
on  the  principle  of  “time  of  flight”  of  a  laser  pulse  to  something  in  its  path,  which  can  readily  be 
converted  to  a  distance  or  range.  By  combining  the  range  and  direction  information  from  the 
ladar  sensor  with  the  UGV  position  from  the  navigation  system,  one  can  build  a  map  of  the 
world  in  a  time-invariant  coordinate  system  that  allows  sensor  information  to  be  accumulated 
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and  integrated  over  time.  The  detailed  phenomenology  of  ladar  sensing  is  beyond  the  scope  of 
this  report. 

The  imaging  ladar  used  in  this  study  collects  data,  not  as  a  single  time-of- flight  measurement  but  in 
a  temporal  stream  of  such  measurements.  Facets  of  a  spinning  mirrored  prism  sweep  a  bank  of 
pulsed  laser  beams  across  the  terrain,  as  in  figure  1 .  Each  facet  reflects  the  beam  at  a  characteristic 
angle  (A)  with  respect  to  axis  of  spin  and  traces  a  partial  conic  surface  through  space.  A  laser  pulse 
synchronized  with  rotational  displacement  ( T)  of  the  mirror  results  in  a  reflected  return  which  is 
sensed.  The  time  of  flight  between  pulse  and  return  is  converted  to  a  distance  that  corresponds  to 
the  range  ( D )  to  a  point  on  the  cone.  This  range  measurement  in  the  context  of  its  direction  in  A 
and  T  is  termed  a  “ ranger  (from  range  pixel).  Geometry  converts  the  point  at  D,  T,  A  to  a  point  at 
Cartesian  X,  Y,  Z  in  sensor  coordinate  frame.  A  scan  is  the  set  of  all  such  points  from  all  facets  as 
the  spinning  prism  completes  one  rotation.1  Each  point  measured  is  effectively  the  output  of  a 
line-of-sight  sensor.  The  scan  thus  samples  a  surface  bounding  the  line  of  sight  from  the  sensor  to 
the  surrounding  terrain. 


1  If  the  facets  are  designed  so  that  the  swept  volumes  align  and  adjoin,  a  scan  so  created  can  be  represented  as  a 
range  image,  a  pixel-like  matrix  of  range  values  mapping  successive  values  of  A  and  T  into  indices  of  image  row 
and  column.  This  can  be  useful  for  visualization  by  assigning  false  color  to  the  range  value  D.  However,  the 
“rows”  swept  by  pulses  reflected  from  a  facet  are  not  lines  but  arcs,  so  an  image  constructed  along  linear  rows  and 
columns,  as  is  conventional,  appears  distorted. 
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A  single  scan  from  this  ladar  is  of  limited  resolution,  but  scans  occur  at  a  rate  comparable  to 
video  frame  rate.  When  mounted  on  a  moving  vehicle,  the  ladar  senses  the  terrain  frequently  and 
from  vantage  points  that  change  as  the  vehicle  progresses  along  its  path,  effectively  sampling  the 
terrain  at  a  spatial  frequency  higher  than  the  raw  sensor  resolution  would  indicate.  Sensor  data 
from  a  stream  of  scans  can  be  combined  with  sensed  vehicle  location  information  from  the 
UGV’s  navigation  system  and  converted  to  a  common  (world)  coordinate  system,  allowing  the 
aggregation  of  terrain  information.  The  aggregated  sensor  information  is  dense  but  distributed  in 
an  irregular  fashion  across  the  terrain.  An  example  of  the  effect  of  this  aggregation  is  shown  in 
figure  2. 


Figure  2.  Left  figure  is  a  single  scan  of  intersection  of  two  walls  at  ~15  m;  right  figure  shows  several  hundred  scans 
of  a  human  in  woods  at  ~15  m,  colored  by  elevation. 


Data  for  this  study  were  generated  by  an  imaging  ladar  built  by  General  Dynamics  Robotic 
Systems  (GDRS).  The  GDRS  ladar  has  been  described  in  Shneier  et  al.  (2003).  Other  imaging 
ladar  technologies  exist;  several  are  described  in  Juberts  and  Barbera  (2004). 


2.2  Spatial  Resolution  and  Voxels 

In  practical  applications  of  terrain  change  detection,  there  is  a  lower  limit  to  the  magnitude  of 
change,  which  is  of  interest.  This  limit  is  dictated  by  the  application  and  precision  of  the  sensor. 
In  this  study,  we  are  uninterested  in  any  change  smaller  than  the  smallest  dimension  of  a  human 
torso  (roughly  0.2  meter),  and  the  precision  of  the  sensor  is  believed  to  be  several  centimeters. 
Therefore,  we  can  simplify  the  problem  and  reduce  the  amount  of  data  to  be  accumulated  by 
mapping  the  data  into  a  grid  with  resolution  appropriate  to  the  application.  In  this  case,  we 
define  a  3-D  matrix  on  0.1 -meter  centers.  This  grid  is  said  to  be  made  of  voxels,  “a  portmanteau 
of  the  words  volumetric  and  pixel”2.  Its  resolution  was  chosen  to  avoid  the  aliasing  that  results 
from  sampling  at  a  interval  greater  than  the  (spatial  analog  of)  Nyquist  “rate”  and  is  believed  to 
be  approximately  the  correct  size  for  this  study. 


2.3  Evidence  Grid 

As  observed  previously,  the  sensor  samples  the  terrain  in  an  irregular  pattern  governed  by  the 
path  traversed  by  the  sensor.  Comparison  of  two  patrols  on  a  point-by-point  basis  is  thus  not 


2http://en.  wikipedia.org/wiki/Voxel 
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possible,  and  even  comparison  on  a  voxel-by- voxel  basis  requires  some  sort  of  normalization  to 
compensate  for  differences  in  the  actual  sampling  realized.  One  method  that  has  proved  useful 
in  similar  problems  is  the  evidence  grid.  In  this  scheme,  grid  cells  are  not  simply  classified  as 
occupied  or  unoccupied  but  instead  store  the  evidence  (certainty)  that  the  cell  is  occupied  or 
empty.  Martin  and  Moravec  (1996)  describe  several  variants  of  evidence  grids  and  an  under¬ 
lying  framework  grounded  in  probability  theory.  The  approach  of  this  study  is  similar  to  an 
early  version  of  Moravec's  work,  without  theoretical  framework.  It  is  described  in  detail  in 
section  3. 

2.4  Change  Detection 

Change  detection  in  3-D  sensor  data  has  infrequently  been  reported  to  have  been  applied  to 
UGV  sensor  data.  An  exception  (without  details)  is  Pacis  et  al.  (2005).  Schutz  and  Hugh  (1996) 
described  change  detection  in  toy  scenes  using  data  from  an  optical  striper.  Processing  occurred 
in  the  range  image  domain  rather  than  the  3-D  domain,  and  the  focus  was  occlusion.  Change 
detection  techniques  have  been  applied  to  elevation  maps  or  classification  maps  constructed  from 
3-D  ladar  or  synthetic  aperature  radar  (SAR)  (e.g.,  Tuong  et  al.,  2004).  Change  detection  in  3-D 
medical  images  is  more  common  (e.g.,  Maintz  and  Viergever,  1998).  Three-dimensional  data 
from  ladar  and  SAR  are  more  typically  applied  to  target  acquisition  problems,  which  are  related 
but  different  (Bhanu  et  al.,  1997). 

In  2-D  applications,  typically  based  on  video  imagery,  change  detection  is  a  common  element  in 
a  host  of  applications,  notably  video  surveillance  and  remote  sensing.  An  excellent  survey  is 
presented  in  Radke  et  al.  (2005). 


3.  Approach 


The  approach  used  in  this  study  is  based  on  a  voxel-based  world  model.  This  representation 
enables  volumes  that  correspond  spatially  to  be  compared  between  patrols.  We  endeavor  to 
classify  voxels  as  “changed”  or  “not  changed,”  based  on  a  specified  minimum  change  between 
patrols  in  the  evidence  of  a  voxel  being  occupied.  Evidence  of  a  voxel  being  occupied  during  a 
patrol  is  measured  by  the  number  of  times  it  was  discovered  to  be  occupied,  normalized  by  the 
number  of  times  the  voxel  has  been  encountered.  The  process  by  which  this  classification  occurs 
is  described  further.  Presentation  graphics  illustrating  the  process  are  shown  in  appendix  A.  The 
following  text  tracks  these  figures  and  references  figures  as  appropriate.  The  reader  may  find  it 
helpful  to  read  the  two  in  tandem. 

Sensor  data  from  the  first  patrol  is  parsed  so  that  each  data  point,  in  world  XYZ  coordinates,  is 
paired  with  the  spatial  location  of  the  sensor  at  the  time  of  the  sense.  These  are  termed  the  point 
and  the  viewpoint  of  the  data  pair.  Each  pair  thus  defined  a  line  segment,  termed  (following 
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conventions  of  the  computer  graphics  community)  a  ray.  This  is  depicted  in  figures  A-l  and 
A-2. 


The  volume  spanned  by  the  sensor  data  from  the  first  patrol  is  divided  into  cubic  voxels  at  a 
resolution  of  0. 1  meter,  as  in  figure  A-3.  Each  voxel  is  assigned  a  set  of  attributes,  defined  in 
table  1 .  The  attributes  occ,  no_occ,  p_occ,  and  diff  are  dynamic  and  change  as  we  collect  more 
sensor  information.  The  attribute  p_l  is  aggregate  and  assumes  a  value  only  at  the  conclusion  of 
processing  sensor  data  from  the  first  patrol.  During  processing  of  data  from  the  first  patrol,  p_l 
and  diff  are  undefined. 

Table  1.  The  attributes  of  a  voxel. 


OCC 

The  number  of  times  a  voxel  was  discovered  to  be  occupied  during  this  patrol 

no  occ 

The  number  of  times  a  voxel  was  discovered  to  be  empty  during  this  patrol 

p  occ 

The  probability  that  a  voxel  is  occupied 

pi 

The  probability  that  a  voxel  was  occupied  during  patrol  1 

diff 

The  probability  the  voxel  has  changed  since  patrol  1 

With  a  technique  from  computer  graphics  known  as  ray  tracing  (Amanatides  and  Woo,  1987),  the 
voxels  encountered  by  the  ray  are  identified.  The  voxel  corresponding  to  the  terminal  point  of  the 
ray  increments  its  occ  attribute,  indicating  an  increase  in  evidence  that  the  voxel  is  occupied.  Any 
voxel  completely  traversed  by  the  ray  increments  its  no_occ  attribute,  indicating  an  increase  in 
evidence  that  the  voxel  is  free  space  and  not  occupied.  This  is  depicted  in  figures  A-4  and  A-5. 

When  all  data  from  the  patrol  are  processed,  the  ratio  p_occ,  representing  a  measure  of  the  evidence 
that  the  voxel  is  occupied  based  on  current  evidence  accumulated,  is  calculated  for  each  voxel,  from 

p_occ  =  occ / (occ  +  noocc) 

With  all  evidence  from  the  first  patrol  having  been  processed,  each  voxel's  p_occ  is  assigned  to  its 
p_l  attribute,  representing  the  evidence  that  the  voxel  was  occupied  during  the  first  patrol,  as  in 
figure  A-6.  At  this  point,  processing  of  the  first  patrol  is  complete,  and  we  begin  processing  data 
from  the  second  patrol.  Figure  A-7  addresses  the  transition. 

Data  from  the  second  patrol  is  loaded  and  parsed  into  rays;  occ  and  no_occ  attributes  are  cleared  to 
initialize  the  processing  of  the  second  patrol.  As  in  the  processing  of  the  first  patrol,  the  rays  from 
the  second  patrol  are  traced,  occ  and  no_occ  incremented  for  all  affected  voxels,  figures  A-8  and 
A-9. 

To  identify  change  voxels,  the  ratio  p_occ  is  calculated  for  each  voxel  and  compared  to  the  corre¬ 
sponding  p_l.  The  difference  diff  between  the  voxel  p_occ  at  the  time  of  the  event  and  its  p_l 
from  the  first  patrol  is  calculated  as 

diff  =  abs(p_occ  -  p_l) 
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and  the  diff  values  are  threshholded  to  differentiate  change  voxels  (voxels  with  diff  >  v_threshhold) 
from  background  voxels  (voxels  with  diff  <v_threshhold).  This  is  depicted  in  figures  A- 10  and 
A-ll. 

To  reduce  outliers,  change  voxels  are  aggregated  into  change  regions,  as  in  figure  A-12.  Any 
change  voxel  within  a  distance  threshhold  r_threshhold  of  the  centroid  of  an  existing  change 
region  is  assigned  to  the  region,  and  the  centroid  is  recomputed.  In  a  cleanup  step,  overlapping 
pairs  of  change  regions  are  absorbed  by  the  larger  of  the  pair. 


4.  Method 


This  study  is  intended  to  evaluate  the  effectiveness  of  the  change  detection  algorithm  described 
in  the  previous  section.  Two  sets  of  data  were  collected3  at  Fort  Indiantown  Gap  (FITG), 
Pennsylvania,  robotics  research  facility.  Each  set  consisted  of  a  base  run,  in  which  the  UGV 
traversed  a  path  through  natural  terrain,  and  several  variant  runs.  The  variant  runs  traversed  the 
same  terrain  along  a  similar  path,  but  in  each  variant,  one  or  more  intentional  changes  were 
introduced.  Data  sets  from  patrols  were  recorded  and  the  change  detection  algorithm  was 
applied  offline.  Detection  of  the  intentional  change  was  the  goal  of  the  exercise. 


5.  Results 


The  first  data  set  was  a  deciduous  wooded  terrain  in  November,  and  the  intentional  change  was  the 
introduction  of  a  person.  Several  variant  runs  were  conducted,  but  the  navigation  system  drifted 
between  runs  so  that  misalignment  of  the  data  between  runs  was  as  great  as  the  intentional 
change.4  Of  this  set,  only  one  variant  was  deemed  sufficiently  aligned  to  evaluate.  The  patrols 
of  this  data  set  are  FITG  143  and  FITG  144.  With  a  naming  convention,  the  comparison  is  named 
FITG  143-144. 

The  second  data  set  was  a  dirt  road  between  two  scrubby  deciduous  tree  lines  in  May.  The  base 
run  is  FITG  129,  and  the  intentional  change  was  the  introduction  of  a  person  in  one  variant 
(FITG  130)  and  the  introduction  of  a  person  and  a  small  lunch  box  roughly  a  0.2-meter  cube 
(FITG  131).  These  runs  were  very  well  aligned. 


3Thanks  to  Dr.  Nicolas  Vandapel  and  Dr.  Martial  Hebert,  of  Carnegie  Mellon  University,  and  the  FITG  staff  of 
GDRS. 

4In  principle,  the  data  could  have  been  aligned,  perhaps  by  a  method  such  as  iterative  closest  point,  but  the 
alignment  was  deemed  beyond  the  scope  of  this  study. 
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In  neither  data  set  was  ground  truth  collected  rigorously,  but  the  person-size  changes  were 
evident  in  the  raw  data  as  visualized  in  a  VRML  viewer5  (see  figure  2b).  The  location  of  the 
lunch  box  was  known  approximately,  and  it,  too,  was  roughly  visible  in  the  source  data. 

The  primary  measure  of  merit  was  the  effectiveness  of  the  algorithm  in  detecting  the  region  of 
intentional  change  while  not  detecting  change  in  other  regions  of  the  scene,  assumed  to  be 
ephemeral  noise.  A  second  measure  of  interest  is  the  distance  at  which  a  change  region  was 
detected  and  the  persistence  of  the  detection.  To  track  this  measure,  change  detection  was 
evaluated  at  events  scheduled  at  intervals  of  30,000  data  points  (equivalent  to  approximately 
0.5  second  of  data  from  the  sensor)  throughout  processing  of  sensor  data  from  the  second  patrol.6 
At  each  scheduled  event,  change  voxels  were  extracted  from  the  volume  and  change  regions 
assessed.  Change  regions  were  then  matched  to  persistent  regions  from  the  previous  evaluation 
event,  again  based  on  spatial  proximity.  The  results  were  projected  to  a  virtual  image  at  the 
instantaneous  vantage  point  of  the  vehicle,  and  false  color  corresponding  to  the  degree  of  change 
assigned  to  each  non-free-space  voxel  was  assigned.  Presented  as  a  movie,  the  3-D  nature  of  the 
terrain  is  quite  apparent,  but  for  the  purpose  of  this  report,  a  static  image  must  suffice.  Change 
regions  in  the  image  were  surrounded  by  red  boxes.  The  distance  from  the  sensor  location  to  the 
change  region  centroid  was  computed  and  the  number  of  change  voxels  comprising  the  change 
region  recorded.  At  the  end  of  evaluation  of  the  second  patrol,  the  trajectories  of  change  region 
size  and  distance  from  the  sensor  were  plotted. 

5.1  FITG 143-144 

In  the  virtual  image  of  the  difference  between  runs  143  and  144,  red  boxes  representing  two 
human-size  targets  can  be  seen  among  the  trees,  as  well  as  a  small  red  box  in  the  tree  branches. 

In  the  accompanying  plot,  the  trail  of  red  x’s  beginning  at  abscissa  value  13,  read  from  the  left- 
hand  ordinate  axis,  tells  us  that  the  first  intentional  human-size  change  (reg  1)  is  detected  first  at 
a  range  of  16  meters.  The  same  color  solid  line,  read  from  the  right-hand  ordinate  axis,  tells  us 
that  reg  1  initially  contains  10  change  points.  Following  the  trail  of  red  markers,  we  can  see  that 
reg  1  is  tracked  throughout  the  patrol,  increasing  in  size  (number  of  detected  change  voxels)  as 
more  sensor  data  become  available.  The  second  intentional  human-size  change  (reg  2,  in 
magenta)  is  detected  at  16  meters’  distance,  lost  momentarily,  and  reacquired  at  15  meters  as 
reg  3,  in  black.  At  1 1  meters’  distance,  the  small  red  box  in  the  tree  branches  is  acquired  as  reg  4 
and  tracked  to  the  end  of  the  patrol.  There  was  no  intentional  change  at  this  position,  so  the 
change  must  be  considered  a  false  positive  (though  it  could  have  been  a  bird  or  squirrel 
unnoticed  by  the  researchers). 


5VRML  (Virtual  Reality  Modeling  Language)  is  a  standard  file  format  for  describing  virtual  3-D  worlds.  Points 
collected  by  the  ladar  sensor  were  formated  for  display  by  an  interactive  VRML  viewer,  allowing  the  3-D  structure 
of  the  data  to  be  visualized. 

6It  would  have  been  more  intuitive  to  schedule  the  events  at  time-based  intervals,  but  the  time  of  acquisition  was 
not  available  in  the  processed  sensor  data. 
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5.2  FITG 129-130 


In  the  data  set,  a  small  lunch  box  was  placed  along  the  wooded  dirt  road.  The  small  red  box  indi¬ 
cating  a  change  region  is  apparent  in  the  virtual  image,  and  the  graph  indicates  that  the  detection 
was  made  at  a  range  of  more  than  13  meters.  However,  after  the  small  item  was  tracked  for 
roughly  6  meters,  the  change  region  diminished  in  size  and  disappeared  since  the  algorithm 
concluded  that,  based  on  the  accumulated  evidence,  there  was  not  a  change  after  all.  (The  patrol 
continued,  although  the  plot  ended  with  no  change  regions  to  report.)  This  represents  a  partial 
success  since  the  change  region  detected  should  have  persisted. 


5.3  FITG  129-131 


In  this  data  set,  the  small  lunch  box  (reg  1)  was  detected  at  a  range  of  1 1  meters  and  is  depicted 
at  the  left  of  the  upper  virtual  image.  It  was  tracked  successfully  as  a  region  of  change  of 
apparent  size  4  voxels  until  the  end  of  the  patrol,  although  no  new  data  were  available  past  scan 
sequence  40  or  thereabouts  since  the  lunch  box  was  no  longer  in  the  field  of  view  of  the  sensor. 
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Changes  detected  between  runs  129-131 ,  FUG 


This  sequence  is  represented  by 
two  virtual  images  since  no  single 
image  contained  all  the  detected 
change  regions.  The  upper  image 
corresponds  to  scan  sequence 
(abscissa  value)  35  and  the  lower 
image  to  scan  sequence  75. 


Also  apparent  in  the  upper  virtual  image  is  a  small  red  box  designated  by  the  white  arrow.  This  is 
reg  2,  which  is  lost  momentarily  and  reacquired  as  reg  3.  This  region  is  a  false  positive  resulting 
from  the  effect  of  occlusion.  The  object  causing  the  detection  was  viewed  from  the  front  and  side 
during  the  traverse  of  patrol  129,  but  during  the  early  portion  of  patrol  131,  only  the  front  could  be 
seen.  As  the  vehicle  drew  near,  the  side  became  increasingly  visible  and  the  difference  in  appear¬ 
ance  fell  below  the  change  threshhold,  so  the  algorithm  ceased  to  declare  the  voxels  changed.  It 
was  identified  as  a  false  positive  for  5  meters  from  the  total  patrol  of  ~40  meters,  with  a  maximum 
apparent  size  of  7  voxels. 

Visible  in  the  lower  virtual  image  is  reg  4,  a  human-size  region  of  intentional  change.  It  is 
detected  at  a  range  of  17  meters  and  tracked  successfully  to  the  end  of  the  patrol. 


9 


6.  Discussion 


6.1  Distribution  of  Voxelized  Data 

During  ray  tracing,  when  a  ray  encounters  a  voxel,  it  passes  through  the  voxel  (evidence  that  the 
voxel  is  unoccupied)  or  it  terminates  (evidence  that  the  voxel  is  occupied  by  something).  During 
a  run,  a  voxel  will  likely  be  encountered  by  dozens  to  thousands  of  rays.  P(occ)  is  a  measure  of 
the  proportion  of  incident  rays  that  are  reflected,  which  is  aggregated  evidence  of  the  likelihood 
that  the  voxel  is  occupied.  Figure  3  plots  p_l  at  the  end  of  the  first  patrol  for  one  of  the  patrol 
pairs  and  is  not  dissimilar  from  plots  for  the  others.  Only  those  voxels  with  P(occ)  >0.1  are 
plotted;  voxels  encountered  with  P(occ)  <  0. 1  (clearly  tree-space  voxels)  are  not  shown.  In  the 
data  sets  of  this  study,  clearly  free-space  voxels  comprised  about  80%  of  voxels  encountered. 

Although  not  apparent  in  the  plots,  it  was  noted  that  the  voxels  with  intermediate  values  of 
P(occ)  tended  to  cluster  around  voxels  with  high  values  of  P(occ).  This  can  be  viewed  as  a 
spatial  blurring  of  the  voxel  content  over  nearby  voxels. 

Cumulative  distribution  of  P(occ) 
for  non-free-space  voxels 


Figure  3.  Cumulative  distribution  of  p_l  for  patrol  143. 

The  thing  that  makes  change  detection  difficult  is  that  the  evidence  grid  method  does  not  classify 
the  volume  clearly  as  occupied  or  unoccupied.  Some  of  the  effects  that  confuse  the  measure  are 
now  described. 
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6.2  Vantage  Point 

As  the  vehicle  moves  along  its  path,  the  terrain  sampled  by  the  sensor  is  determined  by  the 
vantage  point  of  the  sensor  in  full  6  degrees  of  freedom.  The  sampling  is  thus  determined  in  part 
by  the  path  of  the  vehicle  through  the  scene,  which  is  not  tightly  specified  in  the  scenario.  The 
aggregated  sensor  data  are  also  affected  by  time  derivatives  of  the  vehicle  path.  If  the  vehicle 
travels  more  slowly,  the  terrain  is  sampled  at  a  relatively  higher  spatial  rates.  In  this  study,  the 
sensor  was  mounted  on  a  pan/tilt  device,7  adding  to  the  complexity  of  the  sampling. 

6.3  Occlusion 

Occlusion  describes  the  obstruction  of  the  line  of  sight  to  some  object  by  some  other,  nearer 
object,  resulting  in  the  more  distant  object  not  being  seen  in  its  totality.  The  partially  obstructed 
object  is  described  as  being  occluded  by  the  nearer  object.  In  the  context  of  this  study,  as  the 
vehicle  moves  along  its  path,  some  features  previously  occluded  may  become  visible,  and  vice 
versa,  that  is,  the  appearance  of  the  scene  changes. 

6.4  Surface  Versus  Volume 

Representing  the  data  as  XYZ  data  in  a  common  reference  frame,  as  required  to  integrate  terrain 
information  as  the  sensor  moves,  implicitly  treats  the  world  as  a  volume.  However,  the  aggregated 
sensor  data  describe  only  a  set  of  surfaces  visible  from  the  path  described  by  the  sensor,  which  is 
limited  to  observations  in  the  instantaneous  line  of  sight.  Not  only  are  some  terrain  features 
occluded  by  others,  but  elements  of  terrain  features  are  self-occluded  (for  example,  the  far  side 
of  a  tree  trunk  is  occluded  by  the  near  side  of  the  same  trunk). 

6.5  Range  Effects 

The  volume  swept  by  each  rangel  measurement  is  effectively  in  spherical  coordinates,  while  the 
voxels  into  which  the  world  model  is  subdivided  are  Cartesian.  Voxels  near  the  vehicle  may  be 
oversampled  by  a  scan,  while  those  farther  away  are  undersampled.  The  ladar  of  this  study 
samples  at  roughly  0.5-degree  intervals  in  both  horizontal  and  vertical  directions,  so  a  facet  of  a 
voxel  corresponds  to  the  cross  section  of  the  rangel  at  a  range  of  approximately  12  meters. 

In  addition  to  effects  that  blur  P(occ)  of  a  voxel  during  a  single  patrol,  two  additional  effects 
impact  the  difference  diff  between  P(occ)  for  two  different  patrols. 

6.6  Path  Dependence 

While  the  pairs  of  patrols  traverse  the  same  terrain,  the  path  followed  by  the  UGV  in  each  patrol 
is  not  closely  controlled  and  may  diverge  somewhat — in  this  study,  by  a  couple  of  meters.  The 
vantage  point  therefore  changes  not  only  within  a  patrol  but  between  patrols  as  well. 


7Motion  of  the  pan/tilt  device  was  instrumented.  The  effect  of  this  motion  was  factored  into  the  mapping  of 
sensor  points  into  the  world  coordinate  system. 
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6.7  Registration 

Since  all  sensor  data  in  this  scheme  are  converted  to  a  common  world  coordinate  system,  it  is 
important  that  the  coordinate  system  be  stable.  The  key  element  in  maintaining  the  world 
coordinate  system  is  the  navigation  system  of  the  moving  vehicle.  Real-world  navigation  systems 
experience  error  that  varies  with  time  {drift),  and  in  simple  navigation  systems,  that  error  can 
become  quite  large.  Sensor  data  assigned  to  the  wrong  voxel  because  the  navigation  system  is  in 
error  will  be  compared  to  sensor  data  from  a  completely  different  terrain,  potentially  leading  to 
nonsensical  results.  Errors  in  registration  caused  several  data  sets  from  this  study  to  be  discarded. 

6.8  Distribution  of  the  Difference  in  Voxel  Content 

Figure  4  plots  the  cumulative  distribution  of  the  diff  attribute  at  the  end  of  one  of  the  patrol  pairs. 

As  in  figure  3,  voxels  that  were  discovered  to  be  clearly  free  space  are  not  plotted.  The  value 
plotted  represents  the  difference,  on  a  voxel-by-voxel  basis,  between  the  P(occ)  from  the  first  patrol 
and  the  P(occ)  from  the  (entire)  second  patrol.  Since  the  terrain  is  assumed  to  change  very  little 
except  for  the  intentional  changes  introduced,  we  expect  the  measures  of  P(occ)  from  the  two 
patrols  to  be  highly  correlated  and  the  measure  of  the  difference  between  P(occ)  for  the  same  voxel 
to  be  small.  Although  changes  can  occur  because  of  the  changes  described,  a  large  change  is 
unlikely  unless  there  has  truly  been  a  change  in  the  voxel  content.  The  threshhold  v_threshhold  is 
set  accordingly  high.  A  value  of  0.8  was  found  to  be  effective  in  selecting  voxels  of  known  change. 


Cumulative  distribution  of  change  in  P(occ) 


%  non-free-space  voxels  with  change  in  P(occ)  =<  ordinate  value 


Figure  4.  Cumulative  distribution  of  diff  between  patrol  143  and  144.  (Non-free- 
space  voxels  not  plotted.) 
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7.  Conclusions 


This  study  was  undertaken  to  establish  a  basis  for  detecting  change  in  3-D  data  from  a  moving 
imaging  ladar.  The  three  pairs  of  patrols  described  in  this  report  are  insufficient  to  draw  any  firm 
conclusions  concerning  the  effectiveness  of  the  approach  in  detecting  changes  between  patrols. 
Further  study  of  parameters  used  in  voxelization,  threshholding,  and  region  growing  would  be 
required,  as  would  more  data  from  terrain  differing  in  content  and  tools  to  correct  errors  in 
registration  between  patrols.  The  results  achieved  in  the  study  show  promise  for  restricted 
circumstances  but  seem  unlikely  to  be  useful  in  a  real-world  application. 

As  a  precursor  to  the  intended  study  of  detecting  moving  objects  (e.g.,  pedestrians)  from  a  moving 
platform,  however,  the  results  indicate  a  promising  direction.  The  approach  used  here,  that  is, 
operating  on  voxelized  and  normalized  sensor  data,  rather  than  the  raw  data  itself,  is  effective  in 
identifying  regions  of  change,  at  least  in  the  circumstances  of  this  study.  The  approach  of  tracing 
line  segments  (rays)  emanating  from  the  sensor  position  and  terminating  at  the  XYZ  position  of  the 
return  appears  a  useful  counterpoint  to  the  location  of  the  return  in  defining  an  evidence-based 
measure  of  voxel  content.  These  concepts  can  be  carried  over  to  an  ensuing  study.  In  addition, 
it  is  anticipated  that  the  ad  hoc  approach  implemented  here  can  be  improved  if  a  more  rigorous 
probabilistic  framework  is  applied  to  the  voxel  measures  and  their  evaluation. 
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□  We  wish  to  fad 
changes  in  terrain 
encountered  by  a 
patrolling  UGV 
from  one  patrol  to 
another 


□  The  UG  V  is 
equipped  with  a 
scanning  ladar, 
wliich  measure?  the 
distance  to  the 
nearest  surface  at 
an  array  of  points 
spanning  a  piism- 
shaped  field  of 
new 

□  Each  scan  consists 
of  a  ] lumber  of 
discrete  laser 
returns 


Figure  A-l.  Objective:  Find  change  from  patrol  to  patrol. 


□  During  a  patrol 
the  UGV  acquires 
■d  large  number  of 
Jadar  scans 


U  Each  return  is 
converted  to  a 
line  segment 
connecting  die 
sensor  center  to 
the  point  at  which 
the  return  is 
sensed 

□  The  segment  is 
represented  as  a 
3-D  point  and  a 
3-D  viewpoint 


Figure  A-2.  Ladar  return  interpreted  as  line  segment. 


□  Space  near  die 

UGV  is  divided 
into  discrete 

cubic  volumes 
called  voxels 
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Figure  A-3.  Space  interpreted  as  voxels. 


□  Space  near  the 
UGV  is  divided 
into  a  discrete 
cubic  volumes 
called  voxels 

□  Each,  voxel 
traversed  by  the 
segment  is  noted 
as  being  free 
space  (in  blue) 

□  The  voxel  from 
wliich  the  return 
is  sensed  (the 
voxel  containing 
the  “point”)  is 
noted  as  being 
occupied  (in  red) 

□  liRay  tracing” 


Figure  A-4.  Tracing  a  “ray”. 


Appendix  A.  3-D  Change  Detection  With  the  Use  of  an  Evidence  Grid 


□  This  is  done  with 
each  paint  sensed 
by  tins  ladar  unit 

□  As  the  UGV 
moves,  it 
encounter?  eueh 
voxel  many  limes, 
accumulating 
statistical  evidence 
as  to  whether  tlie 
voxel  is  free  space 
(no_occ)  or 
occupied  (occ) 

□  The  proportion  of 
times  the  voxel  is 
seen  as  beiiij* 
occupied  is 
calculated, 

P(acc)  - 

occ  /(occ  +  na_occ) 


Figure  A-5.  Probability  that  a  voxel  is  occupied,  P(occ) 


□  Eventually  an 
entire  volume  lias 
been  seen 

□  This  is  the 
desired 

information  for 
the  end  of  first 
patrol 

□  Voxels  with  a 
high  P(oec)  ( in 
lied)  swim  in  a 
vast  sea  of  free 
space  (m  blue), 
surrounded  by  a 
shallow  layer  of 
voxels  with  ail 
intermediate 
P(oce)  (in  green) 


Figure  A-6.  Voxel  state  at  end  of  first  patrol 


□  When  we  revisit 
this  area,  we 
recall  our  a  priori 
data  from  the  first 
visit 

□  Wc  are  interested 
ui  any  changes  in 
the  terrain 
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Figure  A-7.  Ready  for  second  patrol 


Figure  A-8.  Begin  second  patrol 


□  As  in  tiie  first 
patrol,  we  make 
multiple 

observations  and 

compute  die 
P(oce)  for  each 
voxel,  using  only 
die  sensor  data 
from  this  patrol 


Figure  A-9.  Find  P(occ)  for  each  voxel  during  second  patrol 


□  We  then  compute 
llie  difference 
between  P(occ) 
for  each  of  the 
two  patrols,  for 
each  voxel 


Figure  A- 10.  Compare  P(occ)  for  the  two  patrols 


□  Indie 

representation 

labeled 

"Change”,  green 
is  small  change, 
orange 

intermediate,  and 
icd  large 

U  We  pay  special 
attention  fo 
voxels  which 
show  large 
changes  in  P(occ) 
since  the  previous 
visit. 
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Figure  A-l  1 .  Find  change  voxels 


□  Outliers  are 
culled  if  there  is 
no  other  nearby 
evidence  of 
change 

□  Attention  is  thus 
focused  on  voxels 
showing  a  high 
degree  of  change 

LI  Change  voxels 
arc  aggregated 
into  change 
regions 

□  “Region 
growing7' 


Rim  1  Run  1 


Change 


Figure  A- 12.  Find  change  regions 
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